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Abstract 

Background: Hybridization can have connplex effects on evolutionary dynannics in ants because of the 
combination of haplodiploid sex-determination and eusociality. While hybrid non-reproductive workers have been 
found in a range of species, examples of gene-flow via hybrid queens and males are rare. We studied hybridization 
in East African army ants {Dorylus subgenus Anommo) using morphology, mitochondrial DNA sequences, and 
nuclear microsatellites. 

Results: While the mitochondrial phylogeny had a strong geographic signal, different species were not recovered 
as monophyletic. At our main study site at Kakamega Forest, a mitochondrial haplotype was shared between a 
"Dorylus molestus-Wke" and a "Dorylus wilverthi-Wke" form. This pattern is best explained by introgression following 
hybridization between D. molestus and D. wilverthi. Microsatellite data from workers showed that the two 
morphological forms correspond to two distinct genetic clusters, with a significant proportion of individuals being 
classified as hybrids. 

Conclusions: We conclude that hybridization and gene-flow between the two army ant species D. molestus and D. 
wilverthi has occurred, and that mating between the two forms continues to regularly produce hybrid workers. 
Hybridization is particularly surprising in army ants because workers have control over which males are allowed to 
mate with a young virgin queen inside the colony. 

Keywords: Dorylinae, Formicidae, introgression, microsatellites, mtDNA, gene flow 



Background 

While botanists have long accepted that hybridization 
plays an important role in plant evolution and regularly 
leads to the emergence of new species [1], zoologists 
have traditionally regarded hybridization and interspeci- 
fic gene flow as rare exceptions [2]. However, the advent 
of molecular genetic markers has changed this view dur- 
ing the last decade and it is now widely accepted that 
hybridization between closely related species is also 
common in animals [3-5]. Hybrids are often non-viable 
or sterile due to negative epistasis and therefore tend to 
be rather efficiently removed from the population by 
natural selection [6]. However, in situations where this 
is not the case, hybridization can lead to the collapse of 
closely related species into a single panmictic population 
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[e.g. [7]], or to speciation events when some form of 
reproductive isolation between parental species and 
hybrids emerges [3-5]. 

In haplodiploid eusocial animals such as ants, the con- 
sequences of hybridization can be strikingly different 
from those in other organisms. First, a diploid queen 
that has mated with a heterospecific haploid male will 
still produce purebred sons via arrhenotokous partheno- 
genesis, so that hybrid males will only be produced in 
the F2 generation. Second, potentially negative conse- 
quences of hybrid sterility can be mitigated or even 
avoided when hybrid individuals mostly or exclusively 
become non-reproductive workers [8,9]. Hybrid workers 
do indeed regularly occur in a variety of ant species, 
whereas hybrid queens and males are normally not 
observed [10-12]. Hybridization in ants can therefore 
lead to interesting evolutionary novelties, such as genetic 
caste determination in interdependent lineages of Pogo- 
nomyrmex harvester ants, where purebred females 
become queens and interlineage hybrids become 
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workers [13,14]. Another recent example is a population 
of Formica wood ants with two clearly distinct male 
genepools and hybrid queens [15]. This stable system is 
maintained by strong transmission ratio distortion with 
respect to the sex of the offspring. In both cases there is 
no gene flow between the parental populations. 

In the present study we investigated possible hybridi- 
zation between different species of African swarm-raid- 
ing army ants, with particular focus on Kakamega 
Forest, Kenya, which is one of the few sites where the 
two closely related species, Dorylus (Anomma) molestus 
and Dorylus (Anomma) wilverthi, occur in sympatry. 
African swarm-raiding army ants (the "driver ants") are 
a prominent feature in afrotropical forests, where they 
are prime invertebrate predators with colony sizes of 
over ten million individuals. They form a well-supported 
clade within the Dorylus subgenus Anomma [16]. Like 
in other Hymenoptera, females (queens and workers) 
are diploid, while males are haploid and are produced 
by arrhenotokous parthenogenesis. Army ant queens, 
unlike the queens of most other ants, are permanently 
wingless, never go on a mating flight, and do not found 
new colonies independently. Instead, large colonies pro- 
duce a reproductive brood of several thousand winged 
males and just a few virgin queens. Colony fission 
occurs when the mother queen and a large fraction of 
the worker-force emigrate from the nest and leave the 
developing reproductive brood and the remaining work- 
ers behind. All but one of the virgin queens are elimi- 
nated in yet unknown ways. After the males have 
eclosed, they disperse on the wing to mate with a young 
queen in another colony. Virgin queens, on the other 
hand, never leave their natal colony and mate with typi- 
cally 10 - 30 foreign males that enter the colony from 
outside [17]. Because colonies and their queens only dis- 
perse on foot, gene flow is highly male-biased [18,19]. 

An interesting consequence of this idiosyncratic 
reproductive system is that males have to "run the 
gauntlet of the workers" before they can mate, so that 
workers have ample opportunities to choose the mates 
of their queen-sister [20]. If her later breeding success 
is affected by some of her many matings being hetero- 
specific, the workers are under selection to influence 
the outcome of this mating process because the result 
affects their inclusive fitness. This implies that docu- 
mentation of hybridization and introgression in army 
ants provides direct working hypotheses about the fit- 
ness of mixed genotypes. Dorylus colonies are headed 
by a single queen (monogynous), but due to the high 
queen-mating frequency, most workers in a colony are 
half-sisters, i.e. they have the same mother queen but 
different fathers [17]. 

Previous work showed that the two species at Kaka- 
mega occupy clearly distinct ecological niches, with D, 



wilverthi being essentially restricted to intact rainforest, 
while D. molestus also occurs in savannah habitats and 
open agricultural landscapes [21-23]. Our preliminary 
morphological assessments suggested that both popula- 
tions at Kakamega are aberrant when compared to allo- 
patric populations of the same species, indicating that 
hybridization and introgression may have taken place. 
That hybridization between closely related army ants 
may occur is also consistent with the observation that 
heterospecific males are occasionally found in Dorylus 
{Anomma) colonies [24], and with the description of the 
"hybrid variety" D. sjoestedti var. sjoestedti-wilverthi [25]. 

Here we use a combination of morphological, mito- 
chondrial and nuclear genetic data to show that hybridi- 
zation and introgression have occurred historically 
between different species of swarm-raiding army ants at 
a large geographic scale in East Africa, and that ongoing 
hybridization can be detected in Dorylus workers from 
Kakamega. Our results allow novel inferences about the 
evolutionary relevance of hybridization in social insects. 

Results 

(a) Morphological Analysis 

The D. wilverthi'like workers from Kakamega (see Fig- 
ure 1 for geographic positions of study sites) had the 
posterior angles of the head prolonged into a raised 
point, but not recurved outwards as in specimens from 
more western populations (Figure 2a). More than half of 
the examined D. molestus-hke workers lacked the char- 
acteristic petiolar tubercles. Workers of both forms at 
Kakamega had relatively shorter antennal scapes than 
the workers from the respective "pure" allopatric popu- 
lations (Figure 2b). The best common regression model 
for the morphometric data combined was SL = 0.3039 
HW + 0.6994 (SL = antennal scape length, HW = maxi- 
mum head width). Differences between the relative resi- 
duals between all four groups were highly significant 
(Kruskal-Wallis statistic = 280.7, p < 0.0001, Dunn's 
Multiple Comparison Test in all comparisons p < 0.01). 

(b) Phylogeographic Analysis 

The phylogenetic analysis of mitochondrial haplotypes 
resolved several well supported clades (Figures 3 and 4, 
Table 1). Most strikingly, neither D. molestus nor D. wil- 
verthi were resolved as monophyletic. Instead, the mito- 
chondrial phylogeny reflects geographical patterns rather 
than species affiliations: all samples from the coastal 
area of Kenya and Tanzania south of the Lower Tana 
River form a well-supported clade, extending inland to 
the southern slope of Mt. Kenya (clade 3 in Figures 3 
and 4). The same is true for samples from Ethiopia and 
two of the more northern samples from Kenya and 
Uganda (clade 4). Samples north of and around Lake 
Victoria form another big clade (clade 2), which includes 
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Figure 1 Collection sites for Dorylus {Anommo) samples in East 
Africa. White = D. molestus, black = D. wilverthi, grey = D. terrificus. 
The dots for three main study sites are plotted in larger size. The 
three West and Central African sites (see Table 1) where samples of 
D. sjoestedti, D. nigricans, and D. enneryi were collected are given on 
the inset map. 



haplotypes from all three species that occur in the area. 
All common haplotypes from Kakamega form a mono- 
phyletic group (clade 1), which is nested within the lar- 
ger Lake Victoria group (clade 2). 

(c) Population Structure and Divergence 

We assessed the population structure and divergence 
among three populations of D. molestus (Tana River, 
Mt. Kenya, Kakamega Forest) and one D. wilverthi 
population (Kakamega Forest). None of the pair-wise 
tests for genotypic disequilibrium between microsatellite 
markers was significant (all p > 0.05). Pair-wise esti- 
mates of genetic differentiation between four analysed 
populations at nuclear microsatellites {Fst) are given in 
Table 2. Estimates of Pgr between the D, wilverthi-hke 
and the D. molestus populations were slightly but con- 
sistently higher than estimates between the different D. 
molestus populations (Table 2; one-tailed t-test: t = 
-2.33, df = 4, p = 0.04). 



The mitochondrial haplotype diversity was much 
lower in the D, wilverthi-hke population {Na = 2; = 
1.55; N = 34) than in the D. molestus populations (Kaka- 
mega: Na = 7; Rs = 6.24; N = 42; Mt. Kenya: Na = 8; 
= 6.94; N = 29; Tana River: Na = 5; Rs = 5.00; N = 11). 
Interestingly, the D. wilverthi-hke population was close 
to fixation (allele frequency = 0.97) for a haplotype that 
was also common in the sympatric D. molestus-hke 
population at Kakamega (allele frequency = 0.19) and 
was closely related to the other local D. molestus haplo- 
types (Figures 3 and 4). At the same time this is the 
only haplotype that is shared between any of the four 
populations. The second rare haplotype found in D. wil- 
verthi from Kakamega is closely related to the common 
haplotype and differs only by a single base-pair (Figure 
4). 

Replicate runs in STRUCTURE were highly consistent 
in every analysis. The highest likelihood scores were 
associated with a model with four subdivisions {k = 4, 
mean In L over five replicate runs = -3185.36; 0.23 SD; 
for comparison: k = 2, -3409.86; 0.11 SD; k = 3, 
-3245.90; 0.28 SD; all k > 4 had lower average likeli- 
hoods than /c = 4), although a model with five subdivi- 
sions had very similar likelihood scores {k = 5, -3185.56; 
0.81 SD). The four clusters corresponded well to the 
four populations, although a significant proportion of 
individuals had reasonably high likelihoods of ancestry 
in a population other than their source population (Fig- 
ure 5; Table 2). By far the strongest uncertainty over 
individual assignments was found between the D. wil- 
verthi-hke and D. molestus-hke populations at Kakamega 
Forest, as well as between the Mt. Kenya and Tana River 
populations of D. molestus. When set to divide the sam- 
ples in two groups {k = 2), STRUCTURE consistently 
recovered one cluster consisting of the two Kakamega 
populations and another cluster consisting of D. moles- 
tus samples from Tana River and Mt. Kenya, rather 
than combining the three D. molestus populations. At k 
= 3, STRUCTURE split the two Kakamega populations, 
and at /c = 5 the Tana River + Mt. Kenya samples gener- 
ally formed three clusters, while the two Kakamega clus- 
ters were retained (results not shown). 

To quantify the proportion of hybrid workers at Kaka- 
mega, we arbitrarily classified individuals with an 
inferred proportion of ancestry < 0.9 in the correct 
source population as hybrids. Each hybrid individual 
was then determined to be "shared" between its source 
population and the population in which it had the high- 
est proportion of ancestry among the non-source popu- 
lations. The proportions of individuals that are "shared" 
between each pair of populations are given in Table 2. 
According to this estimate, 18.2% of the genotyped 
workers from Kakamega Forest were hybrids. If we 
lower the cut-off value for the inferred proportion of 



Kronauer et al. Frontiers in Zoology 201 1, 8:20 
http://www.frontiersinzoology.eom/content/8/1/20 



Page 4 of 1 3 



B 




2 4 



Ui 

c 

0) 

_l 

CD 

c 
c 

0) 




2 mm 



[ J £inni 




♦wilverthi 
Amol&stu^ 



1.3 



-T 

Z5 



3 5 



4.5 



Maximum Head Width 



Figure 2 Morphological variation among army ant populations. (A) Photographs of D. wilverthi (left) and D. molestus. Arrows indicate 
diagnostic "horns" in D. wilverthi and petiolar tubercles in D. molestus. Photographs are courtesy of April Mobile and AntWeb at http://www. 
antweb.org. (B) The relationship between antennal scape length (SL) and maximum head width (HW) in workers of "pure" allopatric D. molestus, 
"pure" allopatric D. wilverthi, sympatric D. molestus-Wke individuals from Kakamega, and sympatric D. wilverthi-Wke individuals from Kakamega. 



ancestry from 0.9 to 0.75, still 7.3% of the Kakamega 
workers were hybrids (pairwise comparisons between all 
populations not shown). 

Results from a GENECLASS2 analysis were fully con- 
sistent with the results obtained from STRUCTURE. 
Five D. wilverthi'like individuals were assigned to the D. 



molestuS'like population from Kakamega (the D. moles- 
tuS'like population from Kakamega had the highest 
probability of being the source population among the 
four potential source populations), and one D, molestus- 
like worker from Kakamega was assigned to the D. wil- 
verthi'like population. Similarly, two D. molestus 
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Figure 3 Maximum lil^elihood phylogram showing relationships between mitochondrial haplotypes of East African Dorylus {Anomma) 
swarm-raiding army ants. Each label gives the haplotype number (see Table 1), the abbreviated species name (Dem: D. emeryi; Dmo: D. 
molestus (green); Dni: D. nigricans; Dsj: D. sjoestedti (red); Dte: D. terrificus (grey); Dwi: D. wilvertlii (blue)), the sample locality, and the sample size 
in parentheses for haplotypes that were sampled more than once. Numbered bars on the right refer to clades that are discussed in the text. 
Node labels represent posterior probabilities from Bayesian analyses {*: 0.95 - 0.99; **: 1.0)/maximum likelihood bootstraps (*: 0.75 - 0.9; **: > 0.9). 
No internal labels or a hyphen indicate support values < 0.95 and < 0.75 for the Bayesian and maximum likelihood analysis, respectively. The 
haplotype that is shared between the D. nnolestus-Wke and the D. iv/7\/ert/i/-like forms at Kakamega Forest is highlighted in a red box. 
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Figure 4 Parsimony networic of all East African haplotypes of 

D. molestus, D. wilverthi and D. terrificus. A connecting line 
between liaplotypes represents one mutation and small open 
circles represent missing (inferred) haplotypes. Green haploytypes 
are from D. molestus, blue haplotypes from D. wilverthi, and grey 
haplotypes from D. terrificus. Size of haplotype disks is proportional 
to their frequencies. Boxes with numbers indicate the same 
haplotype clusters as shown in Figure 3. 



workers from Mt. Kenya were assigned to the Tana 
River population, and six workers from Tana River were 
assigned to the Mt. Kenya population. No other cross- 
assignments were found. 

Discussion 

(a) Morphological analysis 

The morphological and morphometric data show that 
two forms of swarm-raiding army ants can be distin- 
guished at Kakamega Forest. These two forms strongly 
resemble D. molestus and D. wilverthi from allopatric 
populations, but are also significantly different from 
"pure" D, molestus and D, wilverthi with intermediate 
phenotypes for several key characters (Figure 2). This 
suggests that both forms have arisen from hybridization 
between D, molestus and D, wilverthi at Kakamega For- 
est. At the same time, gene-flow has not been sufficient 
to completely collapse the two species into a single 
homogeneous hybrid population. 

(b) Mitochondrial phylogeny and introgression 

While mitochondrial haplotypes are clearly structured 
according to geographic origin (Figures 3 and 4), differ- 
ent Dorylus {Anomma) species are not generally recov- 
ered as monophyletic. Below we argue that this finding 
is best explained by the combined effects of unrecog- 
nized cryptic diversity within D, molestus and a complex 
history of mitochondrial introgression in the Lake Vic- 
toria region. D, molestus haplotypes tend to cluster into 
several well supported shallow clades, which are con- 
nected to other such clades by deeper branches (Figure 
3). Furthermore, the Central African D, sjoestedti sam- 
ples cluster with D, molestus samples from coastal 
Kenya and Tanzania, a finding that cannot be explained 
by introgression due to the large geographic distances 
and the fact that more western samples of D. molestus 
do not have haplotypes similar to those of D. sjoestedti. 
Incomplete lineage sorting seems similarly unlikely as 
an explanation (see below), so that this overall pattern is 
suggestive of unrecognized cryptic diversity within D, 
molestus. 

The fact that identical or very similar haplotypes are 
shared between different species in spatially close popu- 
lations around Lake Victoria, on the other hand, points 
towards a complex history of mitochondrial introgres- 
sion in this narrower geographic area. The most striking 
example is found at Kakamega Forest, where the D, wil- 
verthi'hke form is close to fixation for a haplotype that 
is also frequent in the D, molestus-hke form. This haplo- 
type is nested within the clade of Kakamega D, molestus 
haplotypes (clade 1 in Figure 3). The clade of mitochon- 
drial haplotypes from Kakamega Forest is in turn nested 
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Table 1 List of unique mitochondrial COII haplotypes included in this study 


Haplotype no. 


GenBank acc. no. 


Species 


Locality 


N colonies 


1 


GQ999016 


D. wilvefthi 


Kakamega Forest, Kenya 


33 


1 


GQ999023 


D. molest us 


Kakamega Forest, Kenya 


8 


2 


GQ999019 


D. wilvefthi 


Kakamega Forest, Kenya 


1 


3 


GQ999017 


D. molest us 


Kakamega Forest, Kenya 


6 


4 


GQ999020 


D. molestus 


Kakamega Forest, Kenya 


5 


5 


GQ999022 


D. molestus 


Kakamega Forest, Kenya 


7 


6 


GQ999018 


D. molestus 


Kakamega Forest, Kenya 


12 


7 


GQ999021 


D. molestus 


Kakamega Forest, Kenya 


3 


8 


GQ999024 


D. molestus 


Kakamega Forest, Kenya 


1 


8 


GQ999043 


D. molestus 


Mt. Elgon, Kenya 


1 


9 


GQ999042 


D. molestus 


Mt. Kenya (West), Kenya 


1 


10 


EF413797 


D. molestus 


Mt. Kenya (East), Kenya 


1 


1 1 


GU065701 


D. molestus 


Mt. Kenya (East), Kenya 


2 


12 


GU065703 


D. molestus 


Mt. Kenya (East), Kenya 


4 


13 


GU065698 


D. molestus 


Mt. Kenya (East), Kenya 


5 


14 


GU065699 


D. molestus 


Mt. Kenya (East), Kenya 


7 


15 


GU065702 


D. molestus 


Mt. Kenya (East), Kenya 


2 


16 


GU065704 


D. molestus 


Mt. Kenya (East), Kenya 




17 


GU065700 


D. molestus 


Mt. Kenya (East), Kenya 




18 


GQ999037 


D. molestus 


Mt. Kenya (South), Kenya 




19 


GQ999050 


D. molestus 


Tana River (East), Kenya 




20 


GQ999052 


D. molestus 


Tana River (East), Kenya 




21 


GQ999049 


D. molestus 


Tana River (West), Kenya 




22 


GQ999032 


D. molestus 


Tana River (West), Kenya 




23 




l~) mnip'^ti 1^ 


Tana River (West), Kenya 




24 


GQ999027 


D. molestus 


Addis Abeba, Ethiopia 




25 


GQ999028 


D. molestus 


Bale Mts., Ethiopia 




26 


GQ999040 


D. wilvefthi 


Budongo Forest, Uganda 




27 


GQ999035 


D. wilvefthi 


Budongo Forest, Uganda 




28 


GQ999031 


D. molestus 


Semliki, Uganda 




29 


GQ999053 


D. molestus 


Mt. Lole, Kenya 




30 


GQ999045 


D. molestus 


Mt. Warges, Kenya 




31 


EF413804 


D. teffificus 


Kibale Forest, Uganda 


5 


32 


EF413798 


D. wilvefthi 


Kih?ilp Fnrp^t I )n?inH?i 


2 


33 


GQ999034 


D. wilvefthi 


Mabira Forest, Uganda 


2 


34 


GQ999030 


D. molestus 


Kalinzu, Uganda 




35 


GQ999033 


D. molestus 


Nakuru, Kenya 




36 


GQ999039 


D. teffificus 


Mgahinga, Uganda 




37 


GQ999038 


D. molestus 


Ngong Hills, Kenya 




38 


GQ999046 


D. molestus 


Kilimanjaro, Tanzania 




39 


GQ999047 


D. molestus 


Kilimanjaro, Tanzania 




40 


GQ999048 


D. molestus 


North Pare, Tanzania 




41 


GQ999029 


D. molestus 


Taita Hills, Kenya 




42 


GQ999044 


D. molestus 


Taita Hills, Kenya 




43 


GQ999041 


D. molestus 


Arabuko Sokoke, Kenya 




44 


GQ999025 


D. molestus 


Shimba Hills, Kenya 




45 


GQ999026 


D. molestus 


Gombe, Tanzania 




46 


EF413795 


D. sjoestedti 


Ndoki, Republic of Congo 




47 


GQ999054 


D. sjoestedti 


Salonga, DR Congo 




48 


EF413803 


D. fiigficafis 


Tai, Ivory Coast 




49 


EF413773 


D. emefyi 


Tai, Ivory Coast 




Haplotypes that were sampled In different species (haplotype 1) or In different populations of the same species (haplotype 8) are listed 


twice. 
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Table 2 Genetic differentiation between three 
populations of D.molestus and one sympatric population 
of D. wilverthi 







D. molestus 




D. wilverthi 




1. Mt. Kenya 


2. Tana River 


3. Kakamega 


4. Kakamega 


1. 




0.05 


0.11 


0.14 


2. 


48.9 




0.12 


0.16 


3. 


1.9 


4.1 




0.14 


4. 


3.8 


1.0 


18.2 





Above diagonal: Pair-wise genetic differentiation measured as Fst- All 
estimates are significantly different from zero (p < 0.05) after standard 
Bonferroni correction for multiple comparisons. Below diagonal: The 
percentage of individuals that were classified as hybrids (individuals with an 
inferred proportion of ancestry < 0.9 in the correct source population) and 
"shared" between each pair of populations according to the STRUCTURE 
analysis (see main text for details). 

within a larger clade of haplotypes from the Lake Vic- 
toria region (clade 2 in Figure 3), which contains addi- 
tional haplotypes from D. molestus, D, terrificus, and D, 
wilverthi. This pattern indicates a complex history of 
repeated mitochondrial introgression between different 
pairs of species. 

The combination of two main factors makes it difficult 
to determine the original source species of the Lake Vic- 
toria haplotypes: 1. We presently lack haplotype infor- 
mation for populations of D. wilverthi and D. terrificus 
that have clearly not undergone mitochondrial intro- 
gression, i.e. we are missing "true" D, wilverthi and D, 
terrificus haplotypes in our analysis. 2. Mitochondrial 
DNA undergoes frequent selective sweeps, and recurrent 
sweeps are likely to mask previous introgression events 
[26,27]. 

During Pleistocene and Holocene glacial cycles 
[28,29], the Lake Victoria region has seen recurrent 
expansions and retractions of forest and savannah habi- 
tat [30-32], so it is safe to assume that the distributional 
ranges of rainforest specialists like D. wilverthi and 



savannah/forest generalists like D. molestus have chan- 
ged accordingly. The two species probably broadly over- 
lapped during glacial periods of maximum forest 
expansion, when Kakamega Forest was connected to 
Ugandan rainforests [30], while D, wilverthi has been 
restricted to forest islands during interglacial periods as 
is the case at present. Given this complex recent biogeo- 
graphic history, it will also be challenging to associate 
the observed cases of introgressive hybridization with 
discrete historic events. 

It is interesting to note that only a single mitochon- 
drial haplotype is shared between the two forms at 
Kakamega, while essentially all other haplotypes are 
restricted to the D. molestus 'like form. One possible 
explanation is that mitochondrial gene-flow at Kaka- 
mega goes back to a single or very few events, which by 
chance involved only that particular haplotype. Alterna- 
tively, the pattern could be explained by selection. For 
example, all other D. molestusAike haplotypes, or poten- 
tial cytoplasmic factors coupled to those haplotypes (like 
Wolbachia infections [33]), might have epistatic incom- 
patibilities with parts of the D, wilverthi nuclear gen- 
ome. That the reduced mitochondrial diversity in the D, 
wilverthi 'like form at Kakamega is simply due to a 
recent genetic bottleneck seems unlikely because diver- 
sity at nuclear microsatellite loci is not reduced com- 
pared to other populations (Table 3). 

Another noteworthy case of apparent mitochondrial 
introgression was found in the population from Tana 
River, where haplotypes from both sides of the river 
tend to cluster in clearly distinct clades (Figures 3 and 
4) although, according to microsatellite data (see below), 
all samples belong to a single population. This suggests 
that at least one side of the river has captured haplo- 
types from a more distantly related D. molestus clade. 
This strong genetic structure at the maternally inherited 
mitochondrial locus, and a lack thereof at biparentally 




D. \Niiverthi-\\ke Kakamega D. molestus Idim River 

D. molestus-WVjB Kakamega D. molestus Mt. Kenya 



Figure 5 Assignment of individuals to four populations using the program STRUCTURE without a priori assumptions. Shown are 
assignments of individuals from the D. wilverthi-Wke population at Kakamega Forest, the D. molestus-like population at Kakamega Forest, and the 
pure D. molestus populations at Tana River and Mt. Kenya. The y-axis represents the proportion of each multilocus genotype that is attributable 
to each of the four populations. Samples are grouped along the x-axis according to their population of origin. 
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inherited nuclear loci, stems from the fact that the per- 
manently wingless army ant queens cannot cross water 
barriers, while the winged males readily do so [18]. 

Alternative explanations for a lack of monophyly 
between closely related species usually involve the reten- 
tion of ancestral polymorphisms or incomplete lineage 
sorting. However, these mechanisms are less likely to 
apply to mtDNA as compared to nuclear DNA because 
ancestral polymorphisms will be lost more quickly due 
to genetic drift given the smaller effective population 
size of mtDNA and because mtDNA in insects evolves 
significantly faster than nuclear DNA (e.g. [34]). On the 
other hand, mtDNA seems to be particularly prone to 
introgression, at least in species where males are the 
heterogametic or haploid sex (e.g. [34-37]). One reason 
for this is that in such species males tend to suffer more 
from hybridization than females (Haldane's rule; [38]), 
so that gene-flow will be more restricted at nuclear loci 
relative to maternally inherited mitochondrial loci. 
Furthermore, if incomplete lineage sorting occurs at 
random it should not produce an obvious correlation 
with geographic distribution (e.g. [39]), as we observed 
in East African Dorylus {Anomma) species. Under 
incomplete lineage sorting, the minimum divergence 
time between sequences is given by the time elapsed 
since speciation [40]. In the absence of gene flow, muta- 
tions should therefore accumulate quickly in cytochrome 
oxidase II haplotypes, which have high mutation rates of 
ca. 1.5% per million years in insects (e.g. [41]). Given 
the strong geographic signal in our mtDNA phylogeny 
(Figures 3 and 4), the general lack of overlapping haplo- 
types between neighbouring conspecific populations, 
and the fact that sympatric species at Kakamega share 
identical haplotypes for fast evolving mitochondrial 
genes, we conclude that introgressive hybridization 
between army ant species is clearly the best explanation 
for the observed haplotype distribution in the Lake Vic- 
toria area. 

(c) Differentiation and hybridization at nuclear DNA 

The two morphological forms at Kakamega Forest can 
be clearly distinguished based on the nuclear microsatel- 
lite loci. This again shows that species boundaries have 
not become completely blurred, which would be 
expected if gene flow had been significant over extended 
periods of time. On the other hand, despite genetic dif- 
ferentiation among species being consistently higher 
than between conspecific populations (Table 2), a large 
proportion of workers at Kakamega could not be 
unequivocally assigned to one of the two forms by either 
STRUCTURE or GENECLASS2 (Figure 5). At the same 
time, pair-wise assignments between populations were 
far less problematic, even in conspecific comparisons. 
The only exception was the Tana River/Mt. Kenya 



comparison, which showed very weak differentiation 
making assignments difficult. The reason for this genetic 
similarity may well be that these two populations were 
connected via contiguous gallery forests along the Tana 
River until the recent past. Overall, this implies that a 
considerable proportion of workers at Kakamega Forest 
are hybrids between D, molestus and D. wilverthi. 

Conclusions 

This study provides conclusive evidence that introgres- 
sive hybridization has occurred between different Dory- 
lus (Anomma) species. This is one of the few cases of 
documented gene flow between species of eusocial 
Hymentoptera ([11]; but see [42,43] for other recent 
examples) and shows that reproductive isolation 
between species of East African swarm-raiding army 
ants is incomplete. The frequent occurrence of hybrid 
workers at Kakamega means that the two local forms 
still mate regularly today. However, it is important to 
keep in mind that hybrid workers do not necessarily 
translate into gene flow, because social insect workers 
normally do not reproduce. This also applies to Dorylus 
{Anomma) [19,44]. The general notion that hybrid 
workers are common between several pairs of closely 
related ant species, while hybrid queens and males are 
extremely uncommon [11], might explain why the two 
forms at Kakamega have retained their distinct mor- 
phology and ecological niche space, instead of having 
collapsed into a single homogeneous hybrid population. 
Hybridization in army ants is particularly surprising 
because, unlike in most social insects where mating 
occurs outside the nest, workers have control over 
which males obtain access to a virgin queen. Therefore, 
if hybridization would imply reduced colony fitness, 
workers should be under strong selection to recognize 
heterospecific males and prevent them from mating. 
This suggests that hybridization in army ants entails lit- 
tle or no fitness costs or might even be adaptive under 
certain circumstances. Future work will have to investi- 
gate the stability of the hybrid zone at Kakamega over 
time, and the effects of hybridization on the actual gen- 
otypes of Dorylus queens and males, to clarify the pre- 
cise pattern of hybridization and the amount of ongoing 
gene-flow between the two sympatric forms. 

Materials and methods 

(a) Sampling and morphological analysis 

To study patterns of mitochondrial differentiation 
between species and populations of East African Dorylus 
[Anomma) army ants, we collected 151 colony samples 
(one individual per colony) for mitochondrial DNA 
sequencing from 24 localities, representing four mor- 
phologically distinguishable species, D. molestus, Dorylus 
sjoestedti, Dorylus terrificus, and D. wilverthi (Figure 1; 
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Table 1). A single West African sample of Dorylus nigri- 
cans was also included. A haplotype of the West African 
species Dorylus emeryi served as outgroup, as this spe- 
cies belongs to the sister clade of the swarm-hunting 
Anomma "driver ants" [16]. 

Samples for microsatellite genotyping (one worker per 
colony) were collected from four populations at three 
different sites in Kenya: Kakamega Forest (where D. 
molestuS'like and D. wilverthi-like forms are sympatric; 
N = 55 each), the eastern slope of Mt. Kenya {D. moles- 
tus; N = 50), and Lower Tana River {D. molestus; N = 
42). Subsets of these samples were also sequenced for 
mitochondrial DNA (see Table 1). The Mt. Kenya popu- 
lation is geographically situated halfway in between the 
two other populations (Figure 1). All samples were col- 
lected and stored in ethanol. 

We identified samples morphologically using the diag- 
nostic criteria given in the original species descriptions 
and in [45]. Furthermore, samples were directly com- 
pared with type specimens. D. molestus, D. terrificus, 
and D. sjoestedti, which currently have the status of sub- 
species of D, nigricans, were treated as species according 
to [45,46], and the results of an ongoing taxonomic revi- 
sion of the Dorylus {Anomma) swarm-raiding army ants 
(C. Schoning et al. unpublished). Importantly, D. wil- 
verthi and D. molestus workers are easily distinguished: 
D, wilverthi workers have the posterior angles of the 
head uniquely prolonged into a raised, slightly outwardly 
recurved point ("horns"; Figure 2a), while D, molestus 
workers have posteroventral extensions ("tubercles") on 
the petiole (Figure 2a) and significantly shorter appen- 
dages than D, wilverthi [47,48]. To quantify morphologi- 
cal variation we measured antennal scape length (SL) in 
relation to maximum head width (HW) for large work- 
ers (HW > 1.5 mm) in four groups: D. molestus-like 
individuals from Kakamega (N = 54 workers from 10 
colonies), D. wilverthi-hke individuals from Kakamega 
(N = 45 workers from nine colonies), "pure" allopatric 
D. molestus (N = 157 workers from 31 colonies belong- 
ing to 20 populations across the species' range), and 
"pure" allopatric D. wilverthi (N = 85 workers from 16 
colonies belonging to 10 populations across the species' 
range). Measurements were taken with a MS5 Leica 
stereomicroscope fitted with an ocular micrometer. Data 
were analysed following Schoning et al [47]. 

(b) DNA extraction, amplification, sequencing, and 
microsatellite genotyping 

DNA for sequencing was extracted from 1-2 worker legs 
using standard QIAGEN® (DNeasy®) and MACHEREY- 
NAGEL (Nucleo Spin®Tissue) kits. A mitochondrial 
fragment of the cytochrome oxidase II (COII) gene was 
amplified and sequenced using primers tRNALeu [16] 
and Barbara [49] as has been described earlier [16]. 



Sequencing reactions were either performed in house or 
purified PGR products were sent to a commercial 
sequencing facility (Macrogen, Korea). The final aUgn- 
ment used in this study consisted of 548 bp of COII 
sequence. GenBank accession numbers and details for 
sequenced samples are given in Table 1. 

DNA for microsatellite genotyping was extracted by 
boiling 1-2 worker legs in 100 (il of 5% Chelex 100 (Bio- 
Rad). A total of 202 individuals were genotyped at five 
microsatellite loci (DmoB, DmoC, DmoD, DmoG, 
DmoO) as has been described previously [50]. 

(c) Phylogeographic Analyses 

Phylogenetic analyses of mitochondrial sequences were 
conducted in a Bayesian framework using the program 
MrBayes 3.1.2 [51,52], and in a maximum likelihood fra- 
mework using the program GARLI 0.96 [53]. We imple- 
mented a general time reversible model with gamma- 
distributed rate heterogeneity (GTR+G), which was 
selected as the most appropriate model for our data by 
both hierarchical Likelihood Ratio Tests and the Akaike 
Information Criterion in MrModeltest 2.3 [54]. 

To assure convergence of Markov Chain Monte Carlo 
runs in MrBayes we repeated the analysis from two 
independent starting trees. One cold and three heated 
chains were run in parallel for 20'' 10^ generations and 
trees were sampled every 100 generations. Average stan- 
dard deviations of split frequencies were consistently < 
0.01 after 13'' 10^ generations, indicating that the inde- 
pendent runs had converged. Accordingly, the first 
130,000 trees from each run were discarded as burn-in. 
Based on this sample of 140,002 trees from both runs 
combined, all potential scale reduction factors for model 
parameters were < 1.02, indicating effective sampling 
from the posterior probability distribution. A consensus 
phylogram with posterior probabilities based on both 
runs was calculated in MrBayes. 

We initially performed three independent runs in 
GARLI setting the number of generations after which 
the run is terminated if no new significantly better scor- 
ing topology has been found to 30,000, and otherwise 
using default parameter settings. We then performed a 
run with 1,000 bootstrap replicates setting the above 
parameter at 10,000. 

A mitochondrial haplotype network for all East Afri- 
can haplotypes of D. molestus, D. wilverthi and D. terri- 
ficus was constructed using a statistical parsimony 
approach [55] implemented in TCS version 1.21 [56]. 
The connection limit in the TCS program was set at 
unlimited. 

(d) Population Structure and Divergence 

The number of alleles per microsatellite locus {Na)> 
expected heterozygosity (Z/^), allelic richness corrected 
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Table 3 Number of alleles (Na), expected heterozygosity (Hs), allelic richness (Rs), and inbreeding coefficient (F/s) 
estimated for five microsatellite loci and three populations of D.molestus and one sympatric population of D. wilverthi 

D. molestus D. wilverthi 



Mt. Kenya Tana River Kakamega Kakamega 

(N = 50) (N = 42) (N = 55) (N = 55) 



Locus 


Na 


Hs 


Rs 


F,s 


Na 


Hs 


Rs 


F,s 


Na 


Hs 


Rs 


F,s 


Na 


Hs 


Rs 


F,s 


DmoB 


4 


0.56 


3.82 


-0.18 


3 


0.55 


3.00 


0.09 


5 


0.65 


4.97 


0.05 


7 


0.55 


6.84 


0.04 


DmoC 


4 


0.57 


4.00 


0.07 


5 


0.68 


5.00 


0.28 


5 


0.69 


5.00 


0.11 


4 


0.52 


3.75 


0.23 


DmoD 


5 


0.77 


5.00 


-0.05 


8 


0.85 


8.00 


-0.04 


8 


0.81 


7.49 


0.02 


7 


0.74 


6.67 


0.14 


DmoG 


11 


0.86 


10.46 


0.19* 


12 


0.86 


11.98 


0.11 


9 


0.82 


8.49 


-0.02 


6 


0.72 


6.00 


-0.04 


DmoO 


9 


0.86 


8.97 


0.26* 


8 


0.71 


7.95 


0.07 


9 


0.77 


8.99 


-0.04 


9 


0.79 


8.67 


0.10 


mean 


6.6 


0.72 


6.45 


0.06 


7.2 


0.73 


7.19 


0.10 


7.2 


0.75 


6.99 


0.02 


6.6 


0.66 


6.39 


0.09 



F/s estimates significantly different from zero (p < 0.05 after standard Bonferroni correction for multiple comparisons) are marked with an asterisk. 



for sample size (Rs), and the inbreeding coefficient Fjs 
were estimated for each population using the program 
FSTAT 2.9.3.2 [57]. Fjs was tested for significant devia- 
tion from zero using 400 randomisations to assess 
Hardy Weinberg equilibrium. The same program was 
used to test for genotypic disequilibrium between all 
pairs of microsatellite loci within each population 
(Table 3). Furthermore, we used FSTAT to calculate 
pair-wise Fst as an estimate of genetic differentiation at 
microsatellite loci between populations and species. 
FSTAT was also used to calculate Na and Rs for mito- 
chondrial haplotypes. 

We used two different model-based Bayesian methods 
to study hybridization between D. molestus and D, wil- 
verthi at Kakamega. First, the clustering method of the 
program STRUCTURE 2.2 [58] was used to infer popula- 
tion structure at microsatellite loci without prior informa- 
tion on species assignments based on sample locality and 
morphology. The data from all four populations were 
combined in this analysis. To assure convergence and con- 
sistency of results, we conducted five replicate runs with 
different random seeds for each /c, the number of subdivi- 
sions, ranging from one to 10, under an admixture model 
with correlated allele frequencies [59]. Markov chains 
were run for 10^ generations and the first 10^ generations 
were discarded as burn-in. Estimates of hybrid frequencies 
were based on individual probabilities of having ancestry 
in populations other than the source population. The 
comparisons between conspecific populations were used 
as a control to assure that the analyses had sufficient 
power in assigning individuals to different clusters given 
the observed levels of genetic differentiation between 
populations. More recent and potentially ongoing gene- 
flow between conspecific populations should generally 
result in more ambiguous assignments of individuals to 
populations as compared to genetically isolated heterospe- 
cific populations, irrespective of whether these occur in 
sympatry or not. We used the program DISTRUCT 1.1 
[60] to graphically display STRUCTURE results (Figure 4). 



Second, we used the assignment methods in the pro- 
gram GENECLASS2 [61] to estimate the likelihood of 
origin for each individual in each potential source popu- 
lation. All four populations were analyzed in a single 
dataset. To calculate likelihoods, we chose a Bayesian 
criterion with a Dirichlet prior distribution for allele fre- 
quencies [62] using the resampling method of Paetkau 
et al, [63]. The number of simulated individuals for 
probability computation was set at 10,000. 
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